Math 32 - 27: Maximum Likelihood

Today: Maximum Likelihood

Goal: Modify distribution parameters based on observed data

Objectives:

derive maximum likelihood estimate for the exponential distribution
derive maximum likelihood estimate for the Poisson distribution

Notation

Recall,

Lower-case ${x_{1}, x_{2}, x_{3}, . . ., x_{n}}$ is a set of observations
Upper-case ${X_{1}, X_{2}, X_{3}, . . ., X_{n}}$ is a set of random variables (i.e. a data set)
Treating ${X_{1}, X_{2}, . . ., X_{n}}$ as a set of $n$ i.i.d. (independent and identically distributed) random variables is a common assumption.
With independence, $P (X_{1}, X_{2}, . . ., X_{n}) = P (X_{1}) \cdot P (X_{2}) \cdot . . . \cdot P (X_{n})$
Each individual probability is computed (at least theoretically) with a PDF (probability density function) $P (x_{i}) = f_{X} (x_{i})$

Likelihood

::::: {.panel-tabset}

Definition

Likelihood Function

Let the likelihood function, in terms of a parameter $θ$ , be the joint probability

$L (θ) = P (X_{1} = x_{1}, X_{2} = x_{2}, . . ., X_{n} = x_{n}) = f_{X} (x_{1}) \cdot f_{X} (x_{2}) \dots f_{X} (x_{n})$

$L (θ; {x_{i}}_{i = 1}^{n}) = \prod_{i = 1}^{n} f_{X} (x_{i})$

Maximum Likelihood

Likelihood Function

Given a data set ${x_{1}, x_{2}, . . ., x_{n}}$ , we seek the desired parameter(s) that makes realizing the data set most likely.

$L (θ; {x_{i}}_{i = 1}^{n}) = \prod_{i = 1}^{n} f_{X} (x_{i})$

Maximization

From calculus, recall that the main step in maximizing the value of a function is setting the first derivative equal to zero.

simulation a better simulation

Given a data set ${x_{1}, x_{2}, . . ., x_{n}}$ , assume an $Exp (λ)$ distribution.

Compute the value of rate parameter $λ$ that maximizes the likelihood of the data set.
Compute the likelihood at the maximum likelihood estimate (MLE).
Characterize the top 5 percent of light bulbs.

Given a data set ${x_{1}, x_{2}, . . ., x_{n}}$ , assume an $Pois (λ)$ distribution. Compute the value of parameter $λ$ that maximizes the likelihood of the data set.

Estimators Revisited

If we sample from a theoretical $U (0, M)$ distribution, the sample maximum $s_{M}$ of each sample is less than or equal to $M$

$s_{M} \leq M$

It would follow that the average of the sample maxima underestimates the true maximum

$E [s_{M}] \leq M$

Therefore the sample maximum is a biased estimator of the true maximum.

Similarly, the sample minimum $s_{m}$ from a $U (m, 0)$ distribution overestimates

$E [s_{m}] \geq m$

Therefore the sample min-mum is a biased estimator of the true minumum.

Looking Ahead

Final Exam will be on May 6

tweet source